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SEQUENCE LISTING 



(ii) 
(iii) 
(iv) 
(vi) 

(vii) 
(ix) 



IAL INFORMATION: 

APPLICANT: 

(A) NAME: LONZA AG 

(B) STREET : Muenchensteinerstrasse ■ 38 

(C) CITY: Basle 

(E) COUNTRY: Switzerland 

(F) POSTAL CODE: 4002 

TITLE OF INVENTION: Process for the 
preparation of (S)- or (R) -3 , 3 , 3-trif luoro-2- 
hydroxy-2-methylpropionic acid . 



COMPUTER -READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(c) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version 

#1.30 (EPO) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1442 base pairs 

(B) TYPE: Nucleotide 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: circular 

MOLECULE TYPE: Genomic DNA 
HYPOTHETICAL : NO 
ANT I SENSE: NO 
ORIGIN: 

(A) ORGANISM: Klebsiella oxytoca 

(B) STRAIN: PRS1 

( C ) INDIVIDUAL / 1 SOLATE : PRS 1 

PROVENANCE : 

(B) CLONE (S) : pPRS2a 



FEATURE : 

(A) NAME /KEY: CDS 

(B) LOCATION: j oin ( 197 . . 118 1 ) 

(D) OTHER INFORMATION: /product= "amidase" 



(iii) 



NUMBER OF SEQUENCES 




(2) INFORMATION FOR SEQ ID NO : 1: 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO : 1: 
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CCCGGGAACT CCATGTGGCC GTGATCCTGG TCGAGCAGGA TATTGCGATG ATCCAGCGGG 60 

CCGCACAGCG CTGTGCGGTA ATGGATAAAG GCCTGGTTGT AGAAACGCTG ACCCAACAAC 120 

AGCTCTCTGA TGATCTTTTA ATGCGTCGTC ATCTGGCTCT GTAACTAAAC GCTATAAATT 180 

ACGTGGAGAA TAACAT ATG AAA TGG TTG GAA GAA TCC ATT ATG GCC AAA 22 9 
Met Lys Tro Leu Giu Glu Ser lie Met Ala Lys 
1 " 5 10 
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CGC GGT GTT GGT GCC GGG CGT AAA CCG GTA ACG CAT CAC CTG *CG GAA 277 
Arg Gly VaZ Gly Ala Gly Arg Lys Pro Val Thr His His Leu Thr Glu 
X5 20 " 

GAA ATG CAA AAA GAG TTT CAT TAC ACC ATT GGC CCT TAT TCC ACA CCC 325 
Glu Met Gin Lys Glu Ph His Tyr Thr He Gly Pro Tyr Ser Thr Pro 
30 35 40 

GTC CTG ACC ATC GAA CCC GGT GAC CGG ATT ATT GTC GAC ACT CGA GAT 373 
Val Leu Thr He Glu Pro Gly Asp Arg He He Val Asp Thr Arg Asp 
45 50 55 

GCT TTT GAA GGT GCT ATC AAT TCG GAA CAG GAT ATT CCG AGC CAG TTG 421 
Ala Phe Glu Gly Ala He Asn Ser Glu Gin Asp He Pro Ser Gin Leu 
60 €S 70 75 

CTA AAA ATG CCC TTT CTC AAC CCA CAA AAC GGA CCG ATC ATG GTC AAT 469 
Leu Lys Met Pro Phe Leu Asn Pro Gin Asn Gly Pro He Met Val Asn 
80 85 SO 

GGC GCG GAG AAA GGT GAT GTG CTC GCT GTC TAT ATC GAA TCC ATG TTG 517 
Gly Ala Glu Lys Gly Asp Val Leu Ala val Tyr He Glu Ser Met Leu 
35 100 1° 5 

CCC CGC GGC GTT GAT CCC TAC GGC ATC TGC GCC ATG ATT CCG CAT TTT 565 
Pro Arg Gly Val Asp Pro Tyr Gly He Cys Ala Met He Pro His Phe 
110 115 120 

GGC GGA CTG ACC GGG ACC GAC CTG ACG GCC ATG CTC AAT GAT CCG CTG 613 
Gly Gly Leu Thr Gly Thr Asp Leu Thr Ala Met Leu Asn Asp Pro Leu 
125 130 135 

CCA GAA AAG GTG CGC ATG ATT AAA CTC GAC AGT GAA AAG GTC TAC TGG 661 
Pro Glu Lys Val Arg Met He Lys Leu Asp Ser Glu Lys Val Tyr Tr? 
140 145 15° 155 

AGC AAA CGC CAT ACG CTT CCC TAT AAA CCC CAT ATT GGC ACC TTG AGC 709 
Ser Lys Arg His Thr Leu Pro Tyr Lys Pro His He Gly Thr Leu Ser 
150 165 170 

GTA TCG CCA GAA ATT GAC TCA ATC AAT TCA CTG ACG CCA GAC AAT CAC 757 
Val Ser Pro Glu He Asp Ser He Asn Ser Leu Thr Pro Asp Asn His 
175 180 IBS 

GGC GGG AAT ATG GAT GTG CCG GAT ATA GGA CCA GGG AGT ATT ACC TAT 805 
Gly Gly Asn Met Aso Val Pro Asp He Gly Pro Gly Ser He Thr Tyr 
190 * 195 200 

CTG CCG GTA CGT GCG CCT GGA GGC CGC CTG TTT ATT GGT GAT GCC CAT 853 
Leu Pro Val Arg Ala Pro Gly Gly Arg Leu Phe He Gly Asp Ala His 
205 210 215 

GCT TGT CAG GGT GAT GGT GAG ATT TGC GGG ACC GCA GTA GAG TTT GCC 901 
Ala Cys Gin Gly Asp Gly Glu He Cys Gly Thr Ala Val Glu Phe Ala 
220 225 230 235 

TCA ATC ACP ACC ATC AAA GTC GAT TTG ATC AAG AAC TGG CAG CTT TCC 949 
Ser He Thr Thr He Lys Val Asp Leu He Lys Asn Trp Gin Leu Ser 
240 245 250 

TGG CCA CGA ATG GAG AAT GCC GAA AAT ATT ATG AGT ATT GGC AGT GCA 997 
Trp Pro Arg Met Glu Asn Ala Glu Asn He Met Ser He Gly Ser Ala 
255 260 265 



WO 98/01568 



- 38 - 



PCT/EP97/03670 



y 
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CGT CCC CTG GAG GAT GCG ACG CGA ATT GCA TAT CGC GAC TTA ATT TAC 1045 
Arg Pro Leu Glu Asp Ala Thr Arg He Ala Tyr Arg Asp Leu He Tyr 
270 275 280 

TGG CTG GTA GAA GAC TTT GGC TTC GAA CAA TGG GAT GCC TAC ATG CTT 1093 
Trp Leu Val Glu Asp Phe Gly Phe Glu Gin Trp Asp Ala Tyr Met Leu 
265 290 295 

CTG AGT OVA TGC GGC AAA GTG CGG CTG GGC AAC ATG GTC GAC CCC AAA 1141 
Leu Ser Gin Cys Gly Lys Val Arg Leu Gly Asn Met Val Asp Pro Lys 
300 305 310 3is 

TAC ACC GTT GGC GCG ATG CTG AAC AAA AAC CTG TTA GTT TAGTAGGAAT 1190 
Tyr Thr Val Gly Ala Met Leu Asn Lys Asn Leu Leu Val 
320 325 

AACTAACCGG TGAACATTAC CCGGATGTAG ATCGGGGTAA TGTGTAAGTT CAAACAATCG 1250 

CTATTTTTAA CAGCTAAAGC AGGTGCATAT GGGGCCAGAT ACACCCATCA ATATTGGTTT 1310 

ACTTTACTCC TTCAGCGGAG TGACGGCGGC ACAAGAGTTG TCACAATGGC GCGGAGCAAC 1370 

CCAGGCTATT GCCGAAATTA ATCAAAATGG CGGCATCAAC GGCAGACCAC TCAATGCAAT 1430 

TCATTTGGAT CC 1442 

(2) INFORMATION FOR SEQ ID NO : 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 328 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2: 

Met Lys Trp Leu Glu Glu Ser lie Met Ala Lvs Arg Gly Val Gly Ala 
1 5 10 15 

Gly Arg Lys Pro Val Thr His His Leu Thr Glu Glu Met Gin Lys Glu 
20 25 30 

Phe His Tyr Thr lie Gly Pro Tyr Ser. Thr Pro Val Leu Thr lie Glu 
35 40 45 

Pro Gly Asp Arg He He Val Asp Thr Arg Asp Ala Phe Glu Gly Ala 
50 55 60 

He Asn Ser Glu Gin Asp He Pro Ser Gin Leu Leu Lys Met Pro Phe 
65 70 75 80 

Leu Asn Pro Gin Asn Gly Pro He Met Val Asn Gly Ala Glu Lys Gly 

85 90 95 

Asp Val Leu Ala Val Tyr He Glu Ser Met Leu Pro Arg Gly Val Asp 
100 105 110 

Pro Tyr Gly He Cys Ala Met lie Pro' His Phe Gly Gly Leu Thr Gly 
115 ' 120 125 

Thr Asp Leu Thr Ala Met Leu Asn Asp Pro Leu Pro Glu Lys Val Arg 
130 135 140 
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Met lie Lys Leu Asp Ser Glu Lys Val Tyr Trp Ser Lys Arg His Thr 
145 7 150 1S5 1€0 

I-eu Pro Tyr Lys Pro His lie Gly Thr Leu Ser Val Ser Pro Glu lie 
165 170 175 

Asp Ser lie Asn Ser Leu Thr Pro Asp Asn His Gly Gly Asn Met Asp 
180 185 190 

Val Pro Asp He Gly Pro Gly Ser He Thr Tyr Leu Pro val Arg Ala 
195 200 205 

Pro Gly Gly Arg Leu Phe He Gly Asp Ala His Ala Cys Gin Gly Asp 
210 215 22C 

Gly Glu He Cys Gly Thr Ala Val Glu Phe Ala Ser He Thr Thr He 
225 230 235 240 

Lys Val Asp Leu He Lys Asn Trp Gin Leu Ser Trp Pro Arg Met Glu 
245 250 255 

Asn Ala Glu Asn He Met Ser He Gly Ser Ala Ar= Pro Leu Glu Asp 
260 265 270 

Ala Thr Arg He Ala Tvr Arg Asp Leu He Tyr Trr Leu Val Glu Asp 
275 * 280 285 

Phe Gly Phe Glu Gin Trp Aso Ala Tyr Met Leu Leu Ser Gin Cys Gly 
290 295 3C: 

Lys Val Arg Leu Gly. Asn Met Val Asp Pro Lys Tyr Thr Val Gly Ala 
305 310 315 320 

Met Leu Asn Lvs Asn Leu Leu Val 
325 

(2) INFORMATION FOR SEQ ID NO : 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not known 

(D) TOPOLOGY: not known 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGIN: 

(B) STRAIN: PRS1 

( C ) INDIVIDUAL / I SOLATE : PRS 1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3: 

Met Lys Trp Leu Glu Glu Ser lie Met Ala Lys Arg Gly Val Gly Ala 
15 10 15 

Ser Arg Lys Pro 
20 

(2) INFORMATION FOR SEQ ID NO : 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: rot known 

(D) TOPOLOGY: not known 
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(ii) MOLECULE TYPE: peptide 

(vi) ORIGIN: 

(B) STRAIN: PRS1 

( C ) INDIVIDUAL/ ISOLATE : PRS1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4: 

Val Tyr Trp Ser Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not known 

(D) TOPOLOGY: not known 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGIN: 

(B) STRAIN: PRS1 

(C) INDIVIDUAL/ ISOLATE: PRS1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5: 

Lys Pro Val Thr His His Leu Thr Glu Glu Met Gin Lys 
15 10 

(2) INFORMATION FOR SEQ ID NO : 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not known 

(D) TOPOLOGY: not known 

(ii) MOLECULE TYPE: peptide ' 

(vi) ORIGIN: 

(B) STRAIN: PRS1 

(C) IND I VI DUAL /I SOL ATE: PRS1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Tyr Thr Val Gly Ala Met Leu Asn Lys 
1 5 

( 2 ) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: aaiino acid 

(C) STRANDEDNESS: not known 
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(D) TOPOLOGY: not known 
(ii) MOLECULE TYPE: peptide 

(vi) ORIGIN: 

(B) STRAIN: PRS1 

( C ) INDI VI DUAL / I SOLATE : PRS 1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7: 

Met Glu Asn Ala Glu Asn lie Met Ser He Gly Ser Ala Arg 
15 10 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not known 

(D) TOPOLOGY: not known 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGIN: 

(B) STRAIN: PRS1 

(C) INDIVIDUAL /I SOLATE: PRS1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Trp Leu Glu Glu Ser lie Met Ala Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO : 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not known 

(D) TOPOLOGY: not known 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGIN: 

(B) STRAIN: PRS1 

(C) INDIVIDUAL /I SOLATE: PRS1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Pro Phe Leu Asn Pro Gin Asn Gly Pro He Met Val Asn Gly Ala 
15 10 15 

Glu Lys 

(2) INFORMATION FOR SEQ ID NO : 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : not known 

(D) TOPOLOGY: not known 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGIN: 

(B) STRAIN: PRS1 

(C) INDIVIDUAL /ISOLATE: PRS1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Asp Ala Phe Glu Gly Ala lie Asn Ser Glu Gin Asp lie Pro Ser Gin 
IS 10 15 

Leu Leu Lys 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not known 

(D) TOPOLOGY: not known 



(ii) MOLECULE TYPE: peptide 

(vi) ORIGIN: 

(B) STRAIN: PRS1 

( C ) INDIVIDUAL / I SOLATE : PRS 1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Glu Phe His Tvr Thr lie Glv Pro Tyr Ser Thr Pro Val Leu Thr lie 
1*5 10 15 

Glu Pro Gly Asp Arc 
20 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not known 

(D) TOPOLOGY: not known 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGIN: 

(B) STRAIN: PRS1 

( C ) INDIVIDUAL / I SOLATE : PRS 1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
Leu Phe lie Gly Asp Ala His Ala Glu Gin Gly Asp Gly Glu lie Glu 



10 



15 



Gly Thr Ala Val Glu Phe Ala 
20 
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(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not known 

(D) TOPOLOGY: not known 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGIN: 

(B) STRAIN: PRS1 

(C) INDIVIDUAL /ISOLATE: PRS1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Gly Asp Val Leu Ala Val Tyr lie Glu Ser Mec Leu Pro Arc 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 14: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not known 

(D) TOPOLOGY: not known 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGIN: 

(C) INDIVIDUAL /ISOLATE: PRS1 

( vi i ) PROVENANCE : 

(B) CLONE (S) : PRS1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Gly Val Asp Pro Tyr Gly He Glu Ala Met He Pro His Phe v Glv 
1 5 10 -=* 

Leu Thr Gly Thr Asp Leu Thr Ala Met Leu Asn Asp Gin Leu Gin Pro 



20 



25 



30 



Lys 



